# End-to-end speech processing

Ultravox V0 5 Llama 3 1 8b
MIT
A multilingual audio-to-text model based on Llama-3.1-8B-Instruct, supporting processing of over 40 languages
Large Language Model Transformers Supports Multiple Languages
U
FriendliAI
218
0
Wav2vec2 Nepali
Nepali speech recognition model fine-tuned based on Facebook's wav2vec2 model
Speech Recognition Transformers Other
W
anish-shilpakar
312
1
Test Audio
MIT
A Transformer-based end-to-end speech translation model specifically designed for French-to-English speech translation tasks.
Speech Recognition Transformers Supports Multiple Languages
T
joaogante
19
0
Wav2vec2 Large Xlsr Turkish Demo
This model is an XLSR-Wav2Vec2 speech recognition model fine-tuned on the Turkish Common Voice dataset, primarily used for Turkish speech-to-text tasks.
Speech Recognition
W
patrickvonplaten
18
0
Wav2vec2 Xls R 1b 21 To En
Apache-2.0
Facebook's Wav2Vec2 XLS-R model for multilingual speech-to-English translation tasks
Speech Recognition Transformers Supports Multiple Languages
W
facebook
511
3
Wav2vec2 Malayalam Stt
This is a Malayalam speech recognition model based on the Wav2Vec2 architecture, designed to convert Malayalam speech into text.
Speech Recognition Transformers
W
addy88
15
0
Wav2vec2 Large Xls R 300m Turkish Colab 4
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice Turkish dataset based on facebook/wav2vec2-xls-r-300m.
Speech Recognition Transformers
W
nimrah
20
0
Wav2vec2 Nepali Stt
A Nepali speech recognition model based on the Wav2Vec2 architecture, capable of directly converting Nepali speech into text
Speech Recognition Transformers
W
addy88
23
1
S2t Small Covost2 En Fa St
MIT
A Transformer-based end-to-end speech translation model for English-to-Persian speech translation tasks
Speech Recognition Transformers Supports Multiple Languages
S
facebook
49
3
Wav2vec2 Xls R 2b En To 15
Apache-2.0
Facebook's Wav2Vec2 XLS-R model, fine-tuned for speech translation tasks in 15 languages, capable of translating spoken English into multiple written languages.
Speech Recognition Transformers Supports Multiple Languages
W
facebook
27
1
Wav2vec2 Xls R 300m En To 15
Apache-2.0
Facebook's Wav2Vec2 XLS-R model fine-tuned for multilingual speech translation tasks, supporting translation from English to 15 target languages.
Speech Recognition Transformers Supports Multiple Languages
W
facebook
167
6
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase